Microblog Retrieval for Disaster Relief: How To Create Ground Truths?
نویسندگان
چکیده
Microblogging services like Twitter are an important source of real-time information during disasters and can be utilized to aid rescue, relief and rehabilitation efforts. The focus of this work is on the creation of gold standard data for automatic retrieval of helpful tweets. Using various experiments on the gold standard data prepared in the FIRE 2016 Microblog Track [3], we show that the gold standard data prepared in [3] missed many relevant tweets. We also demonstrate that using a machine learning model can help in retrieving the remaining relevant tweets by training an SVM model on a subset of the data and using it to get the most useful tweets in the entire dataset. We obtain high precision and recall even with very little training data, which makes such a model suitable for use in a real-time disaster situation.
منابع مشابه
Microblog Retrieval for Post-Disaster Relief: Applying and Comparing Neural IR Models
Microblogging sites like Twier are important sources of real-time information on ongoing events, such as socio-political events, disaster events, and so on. Hence, reliable methodologies for microblog retrieval are needed for various applications. In this work, we experiment with microblog retrieval techniques for a particular application – identifying tweets that inform about resource needs a...
متن کاملBITS_PILANI@IMRiDis-FIRE 2017: Information Retrieval from Microblog during Disasters
Microblogging sites like Twitter are increasingly being used for aiding relief operations during disaster events. In such situations, identifying actionable information like needs and availabilities of various types of resources is critical for effective coordination of post disaster relief operations. However, such critical information is usually submerged within a lot of conversational conten...
متن کاملOverview of the FIRE 2016 Microblog track: Information Extraction from Microblogs Posted during Disasters
The FIRE 2016 Microblog track focused on retrieval of microblogs (tweets posted on Twitter) during disaster events. A collection of about 50,000 microblogs posted during a recent disaster event was made available to the participants, along with a set of seven practical information needs during a disaster situation. The task was to retrieve microblogs relevant to these needs. 10 teams participat...
متن کاملMicroblog Retrieval in a Disaster Situation: A New Test Collection for Evaluation
Microblogging sites are important sources of situational information during disaster situations. Hence it is important to design and evaluate Information Retrieval (IR) systems that retrieve information from microblogs during disaster situations. The primary contribution of this paper is to develop a test collection for evaluating IR systems for microblog retrieval in disaster situations. The c...
متن کاملAn Information Retrieval System for FIRE 2016 Microblog Track
This paper describes our approaches to FIRE (Forum for Information Retrieval Evaluation) 2016 Microblog track. The main aim of this track was to develop an information retrieval system that can identify relevant tweets posted during a disaster event. The relevance is measured with respect to some predefined topics provide by the track organizers. In this working note we have given the descripti...
متن کامل